34 research outputs found

    Automated Alignment in Multilingual Corpora

    Get PDF

    Adjective Density as a Text Formality Characteristic for Automatic Text Classification: A Study Based on the British National Corpus

    Get PDF
    PACLIC 23 / City University of Hong Kong / 3-5 December 200

    Enhanced Genre Classification through Linguistically Fine-Grained POS Tags

    Get PDF

    How Well Conditional Random Fields Can be Used in Novel Term Recognition

    Get PDF

    eSpaceML: An Event-Driven Spatial Annotation Framework

    Get PDF

    Unsupervised Classification of Biomedical Abstracts using Lexical Association

    Get PDF

    Latin Etymologies as Features on BNC Text Categorization

    Get PDF
    PACLIC 23 / City University of Hong Kong / 3-5 December 200

    A Corpus-Based Quantitative Study of Nominalizations across Chinese and British Media English

    Get PDF

    Improving Automated Alignment in Multilingual Corpora

    Get PDF

    Gene prioritization of resistant rice gene against Xanthomas oryzae pv. oryzae by using text mining technologies

    Get PDF
    To effectively assess the possibility of the unknown rice protein resistant to Xanthomonas oryzae pv. oryzae, a hybrid strategy is proposed to enhance gene prioritization by combining text mining technologies with a sequence-based approach. The text mining technique of term frequency inverse document frequency is used to measure the importance of distinguished terms which reflect biomedical activity in rice before candidate genes are screened and vital terms are produced. Afterwards, a built-in classifier under the chaos games representation algorithm is used to sieve the best possible candidate gene. Our experiment results show that the combination of these two methods achieves enhanced gene prioritization
    corecore